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METHOD FOR SEQUENCING NUCLEIC ACIDS USING MATRIX- ASSISTED LASER DESORPTION 
IONIZATION TIME-OF-FLIGHT MASS SPECTROMETRY 

5 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The invention relates to the determination of nucleotide sequences for nucleic 
acids and their analogs. 

10 Brief Summary of the Related Art 

Determination of nucleotide sequences for various nucleic acids has become a 
fundamentally important analytical step for numerous molecular biology and 
biomedical applications. Consequently, a variety of methods have been developed 
to facilitate such nucleotide sequence determinations. Maxam and Gilbert, Proc. 

15 Natl. Acad. Sci. USA 74: 5460 (1977) discloses a chemical degradation approach for 
DNA sequence determination. Sanger et ah, Proc. Natl. Acad. Sci. USA 74: 5463 
(1977), discloses a chain termination method using complementary strand primer 
extension to determine DNA sequences. Each of these methods utilizes four 
separate reaction mixtures to create a nested set of fragments differing by a single 

20 nucleotide in length and representing a complete nucleotide sequence, followed by 
resolution of the fragments based on their size to determine the order of the 
fragments and hence the nucleotide sequence. Both of these procedures take from 
numerous hours to days to perform, and neither is effective for determining the 
nucleotide sequence for certain analogs of DNA. 

25 These popular approaches to nucleotide sequence determination are limited 

by their time consuming methodologies and by their inapplicability to certain types 
of nucleic acid analytes. For example, small synthetic oligonucleotides have recently 
become of interest as tools in molecular biology experiments, as well as for use in 
the antisense therapeutic approach to disease treatment. Correct sequences are 

30 necessary to the efficacy and safety of such oligonucleotides, and effective and rapid 
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analytical approaches are needed for quality control. This class of compounds 
presents three special problems for traditional sequence determination approaches. 
First, quality control procedures are needed which are more rapid than the 
traditional approaches. Second, the oligonucleotides are generally short, often in the 
range of from about 15 to about 35 nucleotides in length. As a consequence, 
traditional sequencing approaches result in the loss of much sequence information 
and thus provide only incomplete information about the nucleotide sequence of the 
oligonucleotides. Finally, many of these oligonucleotides have either modified 
internucleoside linkages or substitution at the 2* position of the ribose to improve 
their properties as potential therapeutic agents. U.S. Patent No. 5,220,007 discloses 
chimeric oligonucleotides having regions of oligonucleoside phosphodiester or 
phosphorothioate alongside regions of oligonucleoside alkylphosphonate or 
phosphoramidate. PCT publication WO94/02498 discloses hybrid oligonucleotides 
having DNA regions alongside T -substituted RNA regions. Uhlman and Peyman, 
Chemical Reviews 90: 544 (1990), discloses oligonucleotides having a variety of 
modifications along the internucleoside linkages, sugar residues or nucleoside bases. 

Recently, some approaches have been developed to address these special 
problems. U.S. Patent No. 5,403,709 discloses a method for sequencing 
oligonucleotides using another oligonucleotide as an extension and a third, bridging 
oligonucleotide to hold the first two together for ligation. Conventional primer 
extension is then used to create a complement for sequencing. This approach 
requires some advance knowledge of a portion of the sequence of the analyte 
oligonucleotide. US. Patent No. 5,525,470 discloses a similar approach which avoids 
the need for such advance knowledge by utilizing RNA ligase to couple the analyte 
and extension oligonucleotides. Both of these approaches can be used with some 
modified oligonucleotides, especially with those having phosphodiester or 
phosphorothioate internucleotide linkages. Both, however, are time-consuming and 
limited to analyte oligonucleotides which can act as templates for the polymerase 
enzyme used to synthesize their complement. 
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Recently, Nordhoff et ah, J. Mass Spec. 30: 99 (1995) disclosed a direct 
sequencing method for oligonucleotides using infrared matrix-assisted laser 
desorption/ionization mass spectrometry. This method utilized time-of-flight 
analysis of prompt fragment ions generated by infrared laser desorption/ionization. 
5 Fragments were reported to be generated by consecutive cleavage of the 

deoxyribose phosphate backbone at apurinic sites. Unfortunately, such cleavage 
does not occur at T residues, due to failure to remove the thymidine base, and may 
not occur in RNA due to stabilization of the glycosidic bond. In addition, there is a 
problem with the prompt fragment ions overlapping with "post-source decay" 

10 fragment ions (PSD ions) from metastable decays occurring in the field-free region 
of the time-of-flight analyzer. 

Brown and Lennon, Anal. Chem. £7: 3990 (1995) discloses sequence-specific 
fragmentation of matrix-assisted laser-desorbed protein/peptide ions and detection 
of the fragments using time-of-flight mass spectrometry with delayed pulsed ion 

15 extraction. The delayed pulsed ion extraction is used to reduce the generation of 
PSD ions by expanding the desorbed neutral plume during the extraction delay 
period, thereby avoiding energetic collisions believed to play a role in the 
generation of PSD. The technique was found to be applicable to small peptides and 
one special case larger protein. 

20 There remains a need for more rapid approaches for determining the 

nucleotide sequence of nucleic acid analytes. In addition, there is a need for new 
sequencing approaches that address the special sequence determination problems 
presented by synthetic oligonucleotides. Ideally, such a new approach should be 
rapid and universal, should act directly on the nucleic acid analyte, and should be 

25 applicable to any chemically modified oligonucleotides. 



3 
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BRIEF SUMMARY OF THE INVENTION 

The invention provides a universal analytical method for determining the 
nucleotide sequence of nucleic acid analytes, including any chemically modified 
oligonucleotides. This new method utilizes matrix-assisted laser 
desorption/ ionization time-of-flight mass spectrometry (MALDI-TOFMS) with 
delayed pulsed ion extraction. This method is extremely rapid and acts directly on 
the nucleic acid analyte, creating in-source fragmentation (ISF) products which are 
used to determine the sequence of the nucleic acid analyte. Such ISF products have 
earlier also been known as prompt fragmentation ions, and such terms can be used 
interchangeably although in-source fragmentation product is the currently preferred 
term and will be used throughout herein. It is effective for a variety of nucleic acid 
analytes, including any chemically modified oligonucleotides which have not 
previously been successfully sequenced. 

The invention provides a method for determining the nucleotide sequence of 
nucleic acid analytes comprising providing a suitable target for matrix-assisted laser 
desorption/ionization, such target comprising a nucleic acid analyte in a matrix 
suitable for such matrix-assisted laser desorption/ionization; irradiating the target 
with laser irradiation suitable for creating ISF products from the nucleic acid 
analyte; delaying pulsed ion extraction for a time sufficient to reduce formation of 
PSD ion fragments; ordering the ISF products according to molecular weight by 
time-of -flight mass spectrometry; and determining the nucleotide sequence of the 
nucleic acid analyte by comparing the molecular weights of the ordered ISF 
products. 

Preferably, the suitable target will include a solid support, such as the 
commercially available solid supports provided by Bruker Daltonics, formerly 
Bruker Analytical Systems (Billerica, MA) or PerSeptive Biosystems (Cambridge, 
MA). Generally, the target is prepared by applying to the solid support a solution 
comprising the nucleic acid analyte and a matrix suitable for matrix-assisted laser 
desorption/ionization (MALDI), then allowing the solution on the solid support to 
evaporate to dryness. The nucleic acid analyte can be a naturally occurring or 
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synthetic polynucleotide or oligonucleotide, including oligonucleotides having 
chemically modified internucleoside linkages, sugar backbones or nucleoside bases. 
The suitable matrix can be selected from the matrices known in the art to be 
suitable for MALDI, and includes a cation exchange resin, preferably in ammonium 
5 form. Preferred matrices include, without limitation, 3-hydroxypicolinic acid, 3- 
hydroxypicolinic acid/picolinic acid, 3-hydroxypicolinic acid/sucrose, 2,5- 
dihydroxybenzoic acid, sinapinic acid, alpha-cyano-4-hydroxy-cinnamic acid, 
anthranilic acid, 6-aza-2-thiothymine, nicotinic acid, 2,4,6-trihydroxybenzoic acids, 
2,4,6-trihydroxyacetophenone and combinations of the same. Other suitable 

10 matrices are identified in Fitzgerald et ah, Anal. Chem. 65: 3204 (1993). In one 
particularly preferred embodiment, the matrix is 3-hydroxypicolinic acid/N-(3- 
indolylacetyl)-l-leucine (3-HPA/IAL). 

The target can be irradiated with any of a variety of laser irradiation sources. 
Preferred laser irradiation sources include, without limitation, UV lasers, visible 

15 lasers and IR lasers. Particularly preferred UV lasers include a 337 nm N 2 laser, a 
266 nm laser, a 193 nm laser and a 355 nm laser. 

The pulsed ion extraction is delayed for a time sufficient to increase mass 
resolution of the ISF products and to reduce PSD ion formation sufficiently to 
improve the profile of the time-of-flight mass spec (TOFMS) by providing a voltage 

20 pulse to deflect low mass ions and avoid suppression of signals of the larger ions, 
thus resulting in discrete peaks for each species of ISF product. The appropriate 
time for the voltage pulse depends on the size of the largest ion species of interest 
and can conveniently be adjusted to maximize the mass resolution for such ion 
species. 

25 The ISF products are readily separated according to molecular weight by 

TOFMS, which orders the species according to their molecular weight Since the 
mass of each nucleotide is known, this allows determination of the nucleotide 
sequence of the nucleic acid analyte by simple comparison of the molecular weights 
of the consecutive species in the ordered array. 

30 

5 
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BRIEF DESCRIPTION OF THE DRAWINGS 



Figure 1 shows TOFMS spectrum for nucleotides N to N-18 for a 20-mer 
oligonucleotide sequenced using a preferred embodiment of the method according 
5 to the invention. The "w" ions are marked and the molecular weight assignments 
are shown over each peak. 

Figure 2 shows TOFMS spectrum for nucleotide N-19 for the same oligonucleotide 
shown in Figure 1. 

10 

Figure 3 shows TOFMS spectrum for a 15-mer PS all DNA oligonucleotide. The "w" 
ions are marked and the molecular weight assignments are shown over each peak. 

Figure 4. MALDI-TOF mass spectrum of a 17-mer PS/PO oligodeoxynucleotide 
15 (ODN) chimera. In addition to "a-B M (marked by *), V and "d" ions, the "a" ions 
are also present. 

Figure 5. MALDI-TOF mass spectrum of a 25-mer PS ODN. 

20 Figure 6. MALDI-TOF mass spectrum of a 20-mer PS ODN with 2'-0- 
methylation on the sugar units of the first two and last four nucleosides. 

Figure 7. MALDI-TOF mass spectrum of a 18-mer PS/MP chimera. 

25 Figure 8. RP-HPLC chromatograms of a neat 25-mer PS ODN (upper trace) and 
a mixture of the 25-mer PS ODN and a 17-mer PS ODN (lower trace). The mixture 
was separated into 5 fractions as indicated. 



30 



Figure 9. 
HPLC 



MALDI-TOF mass spectra of the 4 of the 5 fractions separated by RP- 

6 
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Figure 10. Proposed fragmentation pathway for modified oligonucleotides 
(MONs) containing negatively charged backbone linkage. 

Figure 11. Proposed fragmentation pathway for modified oligonucleotides 
5 containing neutral backbone linkage. 



7 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The invention relates to the determination of nucleotide sequences for nucleic 
acids and their analogs. The patents and publications cited herein are known to 
5 those skilled in this field and are hereby incorporated by reference in their entirety. 

The invention provides an analytical method for determining the nucleotide 
sequence of nucleic acid analytes, including any chemically modified 
oligonucleotides. This new method utilizes matrix-assisted laser 
desorption/ionization time-of-flight mass spectrometry (MALDI-TOFMS) with 

10 delayed pulsed ion extraction. This method is extremely rapid and acts directly on 
the nucleic acid analyte. It is effective for a variety of nucleic acid analytes, 
including any chemically modified oligonucleotides which have not previously been 
successfully sequenced. 

The invention provides a method for determining the nucleotide sequence of 

15 nucleic acid analytes comprising providing a suitable target for matrix-assisted laser 
desorption/ionization, such target comprising a nucleic acid analyte in a matrix 
suitable for such matrix-assisted laser desorption/ionization; irradiating the target 
with laser irradiation suitable for creating ISF products from the nucleic acid 
analyte; delaying pulsed ion extraction for a time sufficient to reduce formation of 

20 PSD ion fragments; ordering the ISF products according to molecular weight by 
time-of-flight mass spectrometry; and determining the nucleotide sequence of the 
nucleic acid analyte by comparing the molecular weights of the ordered ISF 
products. 

Preferably, the suitable target will include a solid support, such as the 
25 commercially available solid supports provided by Bruker Analytical Systems 

(Billerica, MA) or PerSeptive Biosystems (Cambridge, MA). Preferred solid supports 
include stainless steel and polymer coated metals. Generally, the target is prepared 
by applying to the solid support a solution comprising the nucleic acid analyte and 
a matrix suitable for matrix-assisted laser desorption/ionization (MALDI), then 
30 allowing the solution on the solid support to evaporate to dryness. 



WO 98/03684 



PCT/US97/12688 



The nucleic acid analyte can be a naturally occurring or synthetic 
polynucleotide or oligonucleotide, including oligonucleotides having chemically 
modified internucleoside linkages, sugar backbones or nucleoside bases. For 
purposes of the invention, the term "oligonucleotide" includes polymers of two or 
5 more deoxyribonucleoside, ribonucleoside or 2'-0-substituted ribonucleoside 

monomers, or any combination thereof. Preferably, such oligonucleotides will have 
from about 2 to about 100 monomers, and most preferably from about 13 to about 
40. Such monomers may be coupled to each other by any of the numerous known 
internucleoside linkages. In certain preferred embodiments, these internucleoside 

10 linkages may be phosphodiester (PO), phosphotriester, phosphorothioate (PS), or 
phosphoramidate linkages, or combinations thereof. The term oligonucleotide also 
encompasses such polymers having chemically modified bases or sugars and/ or 
having additional substituents, including without limitation lipophilic groups, 
intercalating agents, diamines and adamantane. For purposes of the invention the 

15 term "2'-0-substituted" means substitution of the 2* position of the pentose moiety 

with an -O-lower alkyl group containing 1-6 saturated or unsaturated carbon atoms, 
or with an -O-aryl or allyi group having 2-6 carbon atoms, wherein such alkyl, aryl 
or allyl group may be unsubstituted or may be substituted, e.g., with halo, hydroxy, 
trifluoromethyl, cyano, nitro, acyl, acyloxy, alkoxy, carboxyl, carbalkoxyl, or amino 

20 groups; or such 2' substitution may be with a hydroxy group (to produce a 
ribonucleoside), an amino or a halo group, but not with a 2-H group. 

The suitable matrix can be selected from the matrices known in the art to be 
suitable for MALDI, and includes a cation exchange resin, preferably in ammonium 
form. In a preferred embodiment, the cation exchange resin is a 200-400 mesh in 

25 ammonium form. Preferred matrices include, without limitation, 3-hydroxypicolinic 
acid, 3-hydroxypicolinic acid/picolinic acid, 3-hydroxypicolinic acid/sucrose, 2,5- 
dihydroxybenzoic acid, sinapinic acid, alpha-cyano-4-hydroxy-cinnamic acid, 
anthranilic acid, 6-aza-2-thiothymine, nicotinic acid, 2,4,6-trihydroxybenzoic acids, 
2,4,6-trihydroxyacetophenone and combinations of the same. Other suitable 

30 matrices are identified in Fitzgerald et ah, Anal. Chem. 65: 3204 (1993). In one 
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particularly preferred embodiment, the matrix is 3-hydroxypicolinic acid/N-(3- 
indolylacetyl)-l-leucine (3-HPA/IAL). 

The target can be irradiated with any of a variety of laser irradiation sources. 
Preferred laser irradiation sources include, without limitation, UV lasers, visible 
5 lasers and IR lasers. Particularly preferred UV lasers include a 337 nm N 2 laser, a 

266 nm laser and a 193 nm laser. One particularly preferred laser irradiation source 
is the 337 nm N 2 laser provided with the MALDI unit produced by Bruker 
Analytical Systems. The laser power is sufficient to efficiently produce ISF products 
without saturating the detector system. To achieve this power level with the Bruker 

10 MALDI unit, a laser attenuation level of from about 40 to about 50 is used to give 
an irradiance of 1-2 107W/cm2. For other types of attenuation systems, attenuation 
levels of from about 10 to about 110 may be advantageously used. The laser 
attenuation level can be measured using a power meter (Laser Probe, Inc., Utica, 
NY, USA). The ion source of the mass spectrometer is opened so that the energy 

15 detector could be placed in the laser beam path in the ion source region. The 

energy output of the laser is recorded at a series of the neutral density attenuator 
settings. 

Several parameters may be automatically set on the MALDI-TOFMS unit, 
such as on the unit produced by Bruker Daltonics. Preferably, the voltage on the 

20 target (acceleration voltage) will be from about +/- 3-30 kV (for positive or negative 
polarity). In one particularly preferred embodiment, such acceleration voltage will 
be about +25 kV. In another particularly preferred embodiment, such acceleration 
voltage will be about +20 kV. The voltage on the auxiliary plate after switching (P2) 
is preferably from about +/- 2-27 kV. In one particularly preferred embodiment, 

25 such P2 voltage is about +22.5 kV. The lens voltage is preferably from about +/- 0- 
15 kV. In one particularly preferred embodiment, such lens voltage is about +9.2 
kV. The voltage on the dual-microchannel-plates detector is preferably from about 
+/- 1.3-2.0 kV. In one particularly preferred embodiment, such detector voltage is 
about +1.5 kV. The time elapsed between laser firing and voltage switching on the 

30 auxiliary plate is preferably from about 100 to about 400 ns. In one particularly 
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preferred embodiment, such elapsed time is about 250 ns. The number of spectra 
summed may preferably range from about 1 to about 1000. In one particularly 
preferred embodiment, such number will be about 80. The vacuum in the sample 
chamber region is preferably at or below 5 x 10"* torr. In one preferred 
5 embodiment, such vacuum is 5 x 10" 7 torr. 

The pulsed ion extraction is delayed for a time sufficient to increase mass 
resolution of the ISF products, and to reduce PSD ion formation sufficiently to 
improve the profile of the time-of-flight mass spec (TOFMS) by providing a voltage 
pulse to deflect low mass ions and avoid suppression of signals of the larger ions, 

10 thus resulting in discrete peaks for each of the ISF products. The MALDI-TOFMS 
unit produced by Bruker is available with the MS modified as to allow delay of the 
pulsed ion extraction following laser irradiation. The appropriate voltage and time 
for the ion deflection voltage pulse depends on the size of the largest ion species of 
interest and can conveniently be adjusted to maximize the mass resolution for such 

15 ion species. Preferably, such ion deflection voltage will range from about 0 to about 
10 kV. In one particularly preferred embodiment, such ion deflection pulse will be 
about +2 kV. Preferably, such time will be from about 0-10 microseconds. In one 
particularly preferred embodiment, such time is about 1-2 microseconds. 

The ISF products are readily separated according to molecular weight by 

20 TOFMS, which orders the products according to their molecular weight. The 

observed ISF products include the "w" and "d" ions, and also the "a" and "z" ions. 
As is understood in the art, the w and z ions are generated by fragmentation of the 
sequence from the 3' end and the a and d ions are generated by fragmentation of 
the sequence from the 5' end. Since the mass of each nucleotide is known, this 

25 allows determination of the nucleotide sequence of the nucleic acid analyte by 
simple comparison of the molecular weights of the consecutive species in the 
ordered array. 

The w ions together with the protonated molecular ions often allows the 
determination of the complete nucleotide sequence. However, when a sequence 
30 contains modified nucleotides, at times the w ions alone are insufficient to 
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determine the entire sequence, and in these cases the d, a and z ions provide 
supplemental information which allows the construction of the complete nucleotide 
sequence. Further the presence of the d, a, and z ions also facilitates the 
determination of the backbone linkage. Thus, for the first time, the sequence and 
5 backbone linkage of an unknown sequence which includes modified nucleotides can 
be determined using a single technique. 

Results of such fragment ordering and molecular weight assignment are 
shown in Figures 1, 2, 3, 4, 5, 6 and 7. The results correctly identified the nucleotide 
sequence of a 20-mer phosphorothioate oligonucleotide having 2 2'-0-methyl 

10 nucleosides at its 5' end and 4 2'-0-methyl nucleosides at its 3' end, Figures 1 and 2, 
and the nucleotide sequence of a 15-mer phosphorothioate deoxyribonucleotide, 
Figure 3. The results correctly identified the nucleotide sequence of a 17-mer 
phosphorothioate /phosphodiester (PS/PO) chimeric oligonucleotide, Figure 4. The 
results correctly identified the nucleotide sequence of a 25-mer phosphorothioate 

15 oligonucleotide, Figure 5. The results correctly identified the nucleotide sequence of 
a 20-mer phosphorothioate oligonucleotide having 2'-0-methylation on the sugar 
units of the first two and last four nucleosides, Figure 6. The results correctly 
identified the nucleotide sequence of a 18-mer phosphorothioate/ 
methylphosphosphonate (PS/MP) chimeric oligonucleotide, Figure 7. The generality 

20 of this method is illustrated by the successful sequencing of four distinct modified 
oligonucleotides as representative compounds. 

The following examples illustrate certain preferred embodiments of novel 
methods for sequencing of oligonucleotides, in particular modified oligonucleotides. 
Using the method according to the invention, the ISF occurring in the MALDI 

25 process leads to the generation of sequence ions of modified oligonucleotides 

allowing a rapid determination of their sequences. While the fragmentation pattern 
appears to be little affected by the modification on the ribose ring of a nucleotide, a 
greater effect has been observed when modification occurs on the backbone 
linkages. In general, "w" ions provide the most extensive sequence information. 

30 The utility of the accompanying "a", M d" and V ions varies according to the type of 
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backbone modification. For PO ODN and MP ODN, useful confirmation or 
supplemental information comes from "d" ions. In contrast, "a" ions give the most 
useful supplemental information for PS ODN. Because the sequencing method of 
the invention involves no chemical or enzymatic reaction, it is by far the most 
5 general sequencing method for small antisense ODNs. Its other advantages include 
the ability to obtain information on the modification groups and speed; a sequence 
can be determined in less than 1 minute rather than hours. 

The following examples are intended to further illustrate certain preferred 
embodiments of the invention and are not intended to limit the scope of the 
10 invention. 



Example 1 

Preparation of a matrix suitable for MALDI 
3-hydroxypicolinic acid (3-HPA; Aldrich Chemical Co., Milwaukee, WI; 10.0 

15 +/- 0.5 mg) was weighed and put into a 1.5 ml Eppendorf tube. To the Eppendorf 
tube was added 75 microliters HPLC grade water (J.T. Baker) and 75 microliters 
HPLC grade acetonitrile (EM Science, Gibbstown, NJ). The tube was capped and 
fixed onto the platform of a Vortex Gene 2 (Scientific Industries, Inc. Bohemia, NY) 
vortexer, which was operated at vibration level 6 for 5 minutes. N-(3-indolylacetyi)- 

20 1-leucine (IAL; Aldrich; 2.0 +/- 0.4 mg) was weighed and put into a 1.5 ml 

Eppendorf tube, to which was added the freshly prepared 3-HPA solution at such 
volume as to result in a final IAL concentration of 16 g/L The Eppendorf tube 
containing the 3-HPA/IAL was then vortexed for 1 minute. Cation exchange resin 
in ammonium form (Bio-Rad Laboratories, Richmond, CA; 200-400 mesh; 25 +/- 1 

25 mg) was weighed and added to the Eppendorf tube containing the 3-HPA/IAL 

solution. The tube was then placed in a heating bath at 50°C for 10 minutes. The 
tube was then removed from the bath and allowed to cool at room temperature for 
2 minutes prior to use. Storage for future use is in darkness. 



30 



Example 2 
13 
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Preparation of an exemplary MALDI target 
An 20-mer oligonucleotide phosphorothioate having two 5* terminal 2 -O 
methyl nucleosides and four 3' terminal 2'-0-methyl nucleosides was prepared as a 
solution in HPLC grade water (Baker) at a concentration of 5000 ppm. Four 
5 microliters of the matrix prepared according to Example 1 was transferred to an 

Eppendorf tube, to which two microliters of the oligonucleotide solution was added. 
To the tube was then added 3 mg of cation exchange resin in ammonium form (200- 
400 mesh), and the tube was then vortexed for 10 seconds. The Eppendorf tube was 
allowed to stand briefly, then 1 microliter of the clear solution was withdrawn and 
10 deposited on a stainless steel target (Bruker). The solvent was then allowed to 
evaporate at ambient conditions. 



Example 3 

Sequence determination of a 20-mer phosphorothioate oligonucleotide nucleic acid 

15 analyte using MALDI-TOFMS with delayed pulsed ion extraction 

The MALDI target prepared according to Example 2 was placed in the 
sample chamber of a Bruker Matrix-assisted Laser Desorption/ Ionization Time-of- 
Flight Mass Spectrometer equipped to allow delay of pulsed ion extraction 
following laser irradiation. The instrument settings were as follows: (1) acceleration 

20 voltage = +25 kV; (2) P2 voltage = +22.5 kV; (3) lens voltage = +9.2 kV; (4) detector 
voltage = +1.5 kV; (5) time elapsed between laser firing and voltage switching on 
the auxiliary plate = 250 ns; (6) laser attenuation level = 40-50; (7) number of spectra 
summed = 80; (8) ion deflection voltage and deflection pulse duration: a +2 kV 
pulse was maintained for 2 microseconds to obtain a spectrum for V ions 

25 comprising N-18 to N for the oligonucleotide (see Figure 1); the same voltage pulse 
was maintained for 1 microsecond to obtain a spectrum for the "w" ion N-19 (see 
Figure 2). The MS was externally calibrated with a set of calibration constants 
generated separately from a known compound under the same conditions. 
Molecular weight assignments were made for each peak, from which the predicted 

30 oligonucleotide sequence was deduced. 
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Further, the modification on the 2' position of the ribose ring by Omethylation 
appears to have little effect on backbone fragmentation. When fragmented by 
MALDI, the 20-mer PS ODN with six 2 , -0-methylated nucleotides in the sequence, 
two on the 5-end and four on the 3'-end, gave four ion series (Figure 6). As in the 
case of the 25-mer PS ODN described below in Example 5, "a", and V ions are the 
dominant sequence ions. Additional fragment ion series include "d" and V (see 
Tables 1 and 2). Because the modification occurs to the nucleosides, each residue 
mass used to assign terminal nucleosides is increased by 30 // into 238 (C), 239 (T), 263 
(A), and 278 (G), respectively. Similarly, the residue mass for each nucleotide 
containing a 2"-0-methyl group is 335 (C), 350 (T), 359 (A), and 375 (G). The terminal 
nucleotides containing a 2 , -0-methyl group should have the following m/z in the 
positive ion mode, 354 (C), 369 (T), 378 (A), and 394 (G). 
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Table / . Measured and theoretical m/z of the V ions of 
the 20-mer all PS ON 



PCT/US97/12688 



ion type obsd. m/z theor. m/z mass diff. base assign. 
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Table £. Measured and theoretical m/z of the "a", "d", and "z" ions of the 20-mer 
all PS ON 

ion type obsd. m/z theor. m/z mass diff. base assign. 
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Example 4 

Sequence determination of a 17-mer PS/PO chimeric nucleic acid analyte 

using MALDI-TOFMS with delayed pulsed ion extraction 
A 17-mer PS/PO chimeric oligodeoxynucleotide MALDI target was placed in 
the sample chamber of a Bruker Matrix-assisted Laser Desorption/ Ionization Time- 
of-Flight Mass Spectrometer equipped to allow delay of pulsed ion extraction 
following laser irradiation. The flight path is 1.6 m. The desorption and ionization 
was effected with a 337 nm N 2 laser. The ir radiance was set at an estimated value 
of 1-2 x 10 7 W/cm 2 to achieve a laser attenuance level of 40-50, by measuring the 
laser energy using a power meter. To improve the signal-to-noise, typically 100 
laser shots were summed. Other instrument settings were as follows: (1) 
acceleration voltage = +20 kV; (2) P2 voltage = +22.5 kV; (3) lens voltage = +9.2 kV; 
(4) detector voltage = +1.5 kV; and (5) time elapsed between laser firing and voltage 
switching on the auxiliary plate = 250 ns. Matrix ions were deflected to avoid 
saturation of the dual-microchannel-plated detector. For fragment ions with a mass 
to charge ration (m/z) less than 1500, the deflection duration was set at 1000 ns, 
while for those with m/z above 1500 the deflection duration was set at 2000 ns. To 
reduce the systematic error introduced by the use of delay-extraction, a mass 
spectrum was calibrated in two segments. In the lower mass region, a matrix ion 
peak and the doubly protonated molecular ion were used as the calibrants. In the 
higher mass region, the singly and doubly protonated molecular ions were used as 
the calibrants. 

Figure 4 shows a mass spectrum of a 17-mer oligodeoxynucleotide consisting 
of both PS and PO nucleotides. Prominent ion series observable in the mass 
spectrum include "d" and V ion series. Also prominently present in the lower 
m/z region of the mass spectrum are the "a" ions (Tabled )• Because the 
measurement was taken in positive ion mode, all ions should have two extra 
protons compared to their negative ion analogs, assuming they are all even electron 
species. For simplicity the same notation for the same ion type is used regardless 
the ion polarity. The presence of a complete set of w ions (w r w 16 ) provides for 

18 



WO 98/03684 PCT/US97/12688 

ready sequence determination. Confirmation of the 17-mer sequence can be made 
by a simple comparison between the expected and experimentally determined m/z 
of M w H ions (Tabled >■ 

The presence of prominent "a" ions in a MALDI-TOF mass spectrum is a 
5 feature that has not been reported for mixed base ODNs before. Equally unusual is 
the presence of some of the a n -B(T) (the type and sequence position of the 
nucleobase cleaved is denoted by the letter in the bracket and an integer number n). 
Although a n ~B ions are commonly observed in a MALDI-TOF mass spectrum of 
ODNs, the loss of thymine has been generally considered a unfavored fragmentation 

10 pathway (Hillenkamp, Smith). Noticeably missing are the "y" ions, except for yl4, 
yl5, and yl6. These three "y" ions are likely the remaining failure sequences 
because the limited efficiency of RP-HPLC in ODN purification. 

A sequence can be constructed from the V ions together with (M+H) + ions. 
The sequence is then compared with segments of sequences constructed from other 

15 ion series for verification. In general, this is a more useful strategy because it can be 
used to determine an unknown sequence. In such an approach, the first step of the 
sequence determination is to find the w k . t peak (k being the total number of the 
nucleotides in a given sequence). Because the difference in m/z between (M+H) + 
and w k _, corresponds to the mass of a 5-terminal nucleoside residue, it should 

20 match one of the five possible values for unmodified deoxynucleoside, 209 (C), 210 
(U), 224 (T), 223 (A), and 249 (G). The m/z difference between two adjacent "w" 
ions, on the other hand, should correspond to the mass of a nucleotide residue. 
Thus the w k _ 2 is located by finding the peak having a m/z smaller by one 
nucleotides the residues mass are 289 (C), 304 (T), 313 (A), 329 (G). These become 

25 305 (C), 320 (T), 329 (A), and 345 (G) for phosphorothioate. The remaining "w" ions 
except w t can be determined successively this way. Being an terminal ion, w x has a 
mass larger by that of H 2 0 than its corresponding residue mass of a repeating unit. 
Additional correction due to the ion polarity requires the addition of one proton in 
the positive ion mode. Therefore, PS w, can have the following masses: 324 (C), 

30 339 (T), 348 (A), and 364 (G); PO w t has smaller masses by 16 \i. 
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For the 17-mer in Figure 4, although sequence determination could be 
achieved with the "w" ion series alone, other type of ions provide useful 
confirmation. Because the use of high laser irradiance caused peak broadening even 
with the use of delayed-extraction, the w 13 ion peak at m/z 4212 is not well 
separated from the peak at m/z 4205. The w 12 peak at m/z 3910 is also weak. This 
affects the assignment of nucleotides at position 4, 5, and 6 (counting from the 5' 
end). However, information can be unambiguously obtained from jd3s(m/z 979), d4 
(m/z 1308), d5 (m/z 1613), and d6 (m/z 1942). As shown in Ta\#e2, Jthe partial 
sequence constructed from these ions match the corresponding sequence constructed 
from "w" ions. Because w, appears in the region where matrix ions are very 
abundant, there may be cases when the former is overwhelmed by the latter. In 
such a case, the H d M H could give the identity of the 3' terminal nucleoside, as does 
the "w M " the 5' terminal nucleoside. Alternative to the "d" ions, the "a" ions could 
also give unambiguous sequence assignment for position 4-6 when n w M ions could 
not. The presence of both "a" and "d" ions could also be very useful in determining 
the structure of backbone linkage. When a backbone linkage in question is flanked 
by both "a" and "d" ion peaks, its identity can be revealed from the m/z difference 
of the two type of ions. In such a case, a m/z difference of 98 indicates a PO 
linkage, while that of 114 indicates PS linkage. 
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Table 3. Measured and theoretical m/z of the sequence ions from the PS-PO 
chimera 

ion type obsd. m/z theor. m/z mass diff. base assign 
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Example 5 

Sequence determination of a 25-mer PS oligodeoxynucleotide using MALDI- 
TOFMS with delayed pulsed ion extraction. 

A 25-mer PS oligodeoxynucleotide was sequenced by MALDI-TOFMS using 
the instrument settings described in Example 4 above. Figure 5 shows a mass 
spectrum of a 25-mer PS ODN. A complete set of "w" ions can be identified (Table 4). 
Accompanying the V ions are the "a" ions giving a distinct pattern of paired peaks 
in the mass spectrum (Table 5). Other ion series with generally lower abundances, 
such as "b", V, "d n , and V ions have also been found (Table 6). As shown, the 
replacement of the non-bridging oxygen atom by a sulfur atom enhances the 
generation of "a" ions. While only partial "a" ion series is observable in the PS/PO 
chimera described in Example 4, an almost complete "a" ions sequence (a 2 -a 24 ) is 
observed here. The a t ions could no be identified because of the presence of an 
abundant matrix ion peak at the same m/z position in the spectrum. Besides the 
prominence of the "a" ions, another feature that appears to be general for PS ODN is 
that the peaks of "w M ions are larger than those of "a" ions of the same length. Also 
noteworthy is the presence of V ion series hitherto unreported in the MALDI 
TOFMS literature. Interestingly, "a-B M ions are missing. Following the strategy 
discussed in the previous section, one compete and three partial sequences can be 
constructed based on the four types of ion series. The sequences are consistent with 
one another. 
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Table Measured and theoretical mix of the "w" ions of the 25-mer all PS ODN 
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Table Measured and theoretical m/z of the "a" ions from the 25-mer all PS ODN 
ion type obsd. m/z theor. m/z mass diff. base assign. 
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Tabic fe. Measured and theoretical m/z of the "b", u c", "d", and "z" ions of 
the 25-mer all PS ODN 
ion type obsd. m/z theor. m/z mass diff. base assign. 
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Example 6 

Sequence determination of a 18-mer PS/MP chimeric oligodeoxynucleotide 
using MALDI-TOFMS with delayed pulsed ion extraction. 

A 18-mer PS/MP chimeric oligodeoxynucleotide was sequenced by MALDI- 
TOFMS using the instrument settings described in Example 4 above. Besides 
phosphorothioate, another popular modified backbone structure is 
methylphosphonate. The substitution of the non-bridging oxygen atom by a methyl, 
as opposed to a sulfur atom for ODNs, reduces the fragmentation efficiency (Figure 
7). A gap appears in the methylphosphonate region. This proves to be extremely 
valuable as they fill the sequence gap. The residue masses for methylphosphonate 
deoxynucleotide are 287 (C), 302 (T), 311 (A), and 327 (G). 

Example 1 

To investigate the origin of the isf Products the 25-mer PS ODN was mixed 
with a 17-mer PS-ODN of the same sequence nested on the 3'-end in a molar ratio 
of 100:1. When analyzed by RP-HPLC, the two PS ODN appear as two separate 
components (Figure 8). Fractionation of the mixture at time points indicated in the 
chromatogram gave 5 fractions. Analysis by MALDI-TOFMS revealed that Fraction 
A consisted mostly of the 17-mer. Due to low the concentration, the signal of the 
17-mer is low (data not shown). The MALDI-TOFMS mass spectra of the other 
fractions are shown in Figure 9. In Fraction B besides the 17-mer no other 
accompanying fragments are present. As expected, the latter eluting fractions 
contain decreasing amount of the 17-mer but increasing amount of the 25-mer. 
Importantly, none of these fractions showed the presence of the fragment ions. In 
contrast, Fraction E, having the highest concentration of the 25-mer, gave all the 
fragment ions (Fig. 10), Note the 17-mer is not present in the spectrum. If the 
fragments were present prior to MALDI, Fraction B is expected to contain not only 
the 17-mer, but also some of the fragments. Conversely, if the 17-mer could be 
removed from Fraction E, the fragments should also have been removed from the 
25-mer if they had been present. We can therefore conclude that the fragment ions 
did not originate from species that are present prior to MALDI; rather, it is the 

result of MALDI. 
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Example S 
The influence of experimental conditions 
5 No qualitative difference has been observed when the 25-mer PS ODN, 

shown in Figure 5, was analyzed in either the positive or negative polarity, using 3- 
HPA/IAL as the matrix. As in the positive polarity, the dominant ion series are "w" 
and "a" ions. Additional ion series include "d" and V ion, the latter appear only in 
the lower m/z region. The tendency to arc in the ion source at high laser irradiance 

10 makes the positive polarity a better choice for routine analysis. 

Although the signal to noise ratio (S/N) of the fast fragment ions can be 
improved as the irradiance increases from the threshold irradiance, which is defined 
as the irradiance necessary to give a S/N of 2 for a given fragment ion, the 
abundance of the fragment ions relative to that of the (M+H) + ion of the 17-mer PS 

15 ODN did not scale linearly with the irradiance applied at the sample. At 6 x 10 6 

W/cm 2 , only M w 3 " - "w 12 " were observed. Increasing the irradiance by 30% made all 
"w" ions and most of the "a" ions observable. However, from this point on further 
increase in irradiance actually decreased the relative abundance of smaller "w M ions 
(smaller than w 10 ), while little affecting that of the larger "w" ions. 

20 The delay time affects the absolute intensity of all ions. However, the 

abundance of fragment ions relative to that of (M+H)* remained little changed 
within 200 - 400 ns. 



Example ^ 

25 Proposed fragmentation pathway for modified oligonucleotides 

Because "a-B" ions of ODNs are usually present in a MALDI-TOF mass 
spectrum, most of the ion fragmentation mechanisms which have been proposed 
involve the cleavage of the base, followed by the cleavage of the 3* glycosidic bond. 
However, using the method according to the invention, a PS ODN could give 

30 prominent depurinated products while showing little sign of M a-B M ions. The 
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observation suggests that there is another fragmentation pathway that is 
independent of the base cleavage. The m/z values of the "a" and "w" ions suggest 
that the cleavage involves at least one hydrogen transfer. The transfer could occur 
directly from the "a" precursor to the "w" precursor or involve the matrix species, as 
the cleavage happens in an area where matrix ions and radicals are present. 
However, the observation that PS ODNs have greater tendency than PO or MP 
ODNs to form "w" and "a" ions is not in favor of a cleavage mechanism involving 
matrix species. Instead, the observation is better explained by a mechanism 
involving El type syn elimination (Lowry, T, H., Richardson, K. S. Mechanism and 
theory in organic chemistry, Harper & Row: New York, 1981, pp 530-556). In this 
mechanism, the 4' hydrogen is transferred to the negatively charged non-bridging 
oxygen or sulfur atom resulting in the formation of a double bond between 3* and 4' 
positions and the cleavage of the glycosidic bond (Figure 10). The replacement of 
the non-bridging oxygen with a neutral methyl group would impede the process. 
In such a case, the 5' PM group could orient itself to participate in the elimination of 
the 4* hydrogen leading to the formation of a double bond between the 4' and 5' 
positions or a rearranged sugar ring (Figure 11). 
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What is claimed is: 

1. A method for determining the nucleotide sequence of a nucleic acid analyte, 
the method comprising: 

a) providing a suitable target for matrix-assisted laser 
desorption/ ionization, such target comprising a nucleic acid analyte in a 
matrix suitable for such matrix-assisted laser desorption/ionization; 

b) irradiating the target with laser irradiation suitable for creating in- 
source fragmentation products from the nucleic acid analyte; 

c) delaying pulsed ion extraction for a time sufficient to increase mass 
resolution of the in-source fragmentation products and to reduce formation of 
PSD ion fragments; 

d) ordering the in-source fragmentation products according to molecular 
weight by time-of-flight mass spectrometry; and 

e) determining the nucleotide sequence of the nucleic acid analyte by 
comparing the molecular weights of the in-source fragmentation products. 

2. The method of claim 1 wherein said nucleic acid analyte further comprises a 
naturally occuring or synthetic oligonucleotide or polynucleotide. 

3. The method of claim 2 wherein said oligonucleotide further comprises an 
oligonucleotide having at least one chemically modified internucleoside linkage, 
chemically modified sugar backbone or chemically modified nucleoside base. 
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Figure 12 
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